Multi-channel opportunistic access : a restless multi-armed bandit perspective. (Accès opportuniste dans les systèmes de communication multi-canaux : une perspective du problème de bandit-manchot)

نویسنده

  • Kehao Wang
چکیده

Cognitive radio, first envisioned by Mitola, is the key enabling technology for future generations of wireless systems that addresses critical challenges in spectrum efficiency, interference management, and coexistence of heterogeneous networks. The core concept in cognitive radio networks is opportunistic spectrum access, whose objective is to solve the imbalance between spectrum scarcity and spectrum under-utilization. In the thesis, we address the fundamental problem of opportunistic spectrum access in a multi-channel communication system. Specifically, we consider a communication system in which a user has access to multiple channels, but is limited to sensing and transmitting only on part of them at a given time. We explore how the smart user should exploit past observations and the knowledge of the stochastic properties of these channels to maximize its transmission rate by switching channels opportunistically. Formally, we provide a generic analysis on the opportunistic spectrum access problem by casting the problem into the restless multi-armed bandit (RMAB) problem, one of the most well-known generalizations of the classic multi-armed bandit (MAB) problem, which is of fundamental importance in stochastic decision theory. Despite the significant research efforts in the field, the RMAB problem in its generic form still remains open. Until today, very little result is reported on the structure of the optimal policy. Obtaining the optimal policy for a general RMAB problem is often intractable due to the exponential computation complexity. Hence, a natural alternative is to seek a simple myopic policy maximizing the short-term reward. We start by conducting a generic analysis in Chapter 3 on the optimality of the myopic sensing policy where the user senses more than one channel each time and gets one unit of reward if at least one of the sensed channels is in the good state. Through mathematical analysis, we show that the myopic sensing policy is optimal only for a small subset of cases where the user is allowed to sense two channels each slot. In the general case, we give counterexamples to v te l-0 08 32 56 9, v er si on 1 11 J un 2 01 3

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cooperation control in Parallel SAT Solving: a Multi-armed Bandit Approach

In recent years, Parallel SAT solvers have leveraged with the so called Parallel Portfolio architecture. In this setting, a collection of independent ConflictDirected Clause Learning (CDCL) algorithms compete and cooperate through Clause Sharing. However, when the number of cores increases, systematic clause sharing between CDCLs can slow down the search performance. Previous work has shown how...

متن کامل

Bandits Warm-up Cold Recommender Systems

We address the cold start problem in recommendation systems assuming no contextual information is available neither about users, nor items. We consider the case in which we only have access to a set of ratings of items by users. Most of the existing works consider a batch setting, and use cross-validation to tune parameters. The classical method consists in minimizing the root mean square error...

متن کامل

An Analysis of Some Multiuser Detectors in Impulsive Noise

L’object de cet article est l’étude de la détection de données à travers des canaux de communication non gaussiens, en séquence directe, spectre large et accès multiple. Ce problème apparaît dans des situations pratiques parce que la plupart des canaux physiques, dans lesquels les communications en multiple accès sont utilisées, sont connus pour être incontestablement non gaussiens. Concernant ...

متن کامل

On Optimality of Myopic Policy for Restless Multi-armed Bandit Problem with Non i.i.d. Arms and Imperfect Detection

We consider the channel access problem in a multi-channel opportunistic communication system with imperfect channel sensing, where the state of each channel evolves as a non independent and identically distributed Markov process. This problem can be cast into a restless multi-armed bandit (RMAB) problem that is intractable for its exponential computation complexity. A natural alternative is to ...

متن کامل

A Restless Bandit Formulation of Multi-channel Opportunistic Access: Indexablity and Index Policy

We focus on an opportunistic communication system consisting of multiple independent channels with time-varying states. With limited sensing, a user can only sense and access a subset of channels and accrue rewards determined by the states of the sensed channels. We formulate the problem of optimal sequential channel selection as a restless multi-armed bandit process, for which a powerful index...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012